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3 <110> APPLICANT: TAKEDA SAN DIEGO, INC. 

5 <120> TITLE OF INVENTION: CRYSTALLIZATION OF CATHEPSIN S 
7 <130> FILE REFERENCE: CATS-5002-C1 
9 <140> CURRENT APPLICATION NUMBER: 10/646 ,470A 
10 <141> CURRENT FILING DATE: 2003-08-22 

12 <150> PRIOR APPLICATION NUMBER: US 60/405,423 

13 <151> PRIOR FILING DATE: 2002-08-23 
15 <160> NUMBER OF SEQ ID NOS : 4 

17 <170> SOFTWARE: Patentln version 3.3 

19 <210> SEQ ID NO: 1 

20 <211> LENGTH: 331 

21 <212> TYPE: PRT 

22 <213> ORGANISM: Homo sapiens 

25 <220> FEATURE: 

26 <221> NAME /KEY: MI SC_FEATURE 

27 <222> LOCATION: (1) . . (331) 

28 <223> OTHER INFORMATION: Amino acid sequence for full-length human wild type Cathepsin 



30 


<300> PUBLICATION INFORMATION: 


















31 


<308> DATABASE ACCESSION NO: 


AF230097 














32 


<309> DATABASE ENTRY DATE: 2002- 


-04-08 














33 


<313> RELEVANT RESIDUES 


: (1) 


. . (331) 
















35 


<400> SEQUENCE: 


1 
























37 


Met 


Lys 


Arg 


Leu 


Val 


Cys 


Val 


Leu 


Leu 


Val 


Cys 


Ser 


Ser 


Ala 


Val 


Ala 


38 


1 








5 










10 










15 




41 


Gin 


Leu 


His 


Lys 


Asp 


Pro 


Thr 


Leu 


Asp 


His 


His 


Trp 


His 


Leu 


Trp 


Lys 


42 








20 










25 










30 






45 


Lys 


Thr 


Tyr 


Gly 


Lys 


Gin 


Tyr 


Lys 


Glu 


Lys 


Asn 


Glu 


Glu 


Ala 


Val 


Arg 


46 






35 










40 










45 








49 


Arg 


Leu 


He 


Trp 


Glu 


Lys 


Asn 


Leu 


Lys 


Phe 


Val 


Met 


Leu 


His 


Asn 


Leu 


50 




50 










55 










60 










53 


Glu 


His 


Ser 


Met 


Gly 


Met 


His 


Ser 


Tyr 


Asp 


Leu 


Gly 


Met 


Asn 


His 


Leu 


54 


65 










70 










75 










80 


57 


Gly 


Asp 


Met 


Thr 


Ser 


Glu 


Glu 


Val 


Met 


Ser 


Leu 


Met 


Ser 


Ser 


Leu 


Arg 


58 










85 










90 










95 




61 


Val 


Pro 


Ser 


Gin 


Trp 


Gin 


Arg Asn 


He 


Thr 


Tyr 


Lys 


Ser 


Asn 


Pro 


Asn 


62 








100 










105 










110 






65 


Arg 


He 


Leu 


Pro 


Asp 


Ser 


Val 


Asp 


Trp 


Arg 


Glu 


Lys 


Gly 


Cys 


Val 


Thr 


66 






115 










120 










125 








69 


Glu 


Val 


Lys 


Tyr 


Gin 


Gly 


Ser 


Cys 


Gly 


Ala 


Cys 


Trp 


Ala 


Phe 


Ser 


Ala 


70 




130 










135 










140 










73 


Val 


Gly 


Ala 


Leu 


Glu 


Ala 


Gin 


Leu 


Lys 


Leu 


Lys 


Thr 


Gly 


Lys 


Leu 


Val 


74 


145 










150 










155 










160 


77 


Ser 


Leu 


Ser 


Ala 


Gin 


Asn 


Leu 


Val 


Asp 


Cys 


Ser 


Thr 


Glu 


Lys 


Tyr 


Gly 
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78 165 










170 








175 




81 Asn Lys Gly Cys Asn 


Gly 


Gly 


Phe 


Met 


Thr 


Thr Ala 


Phe 


Gin 


Tyr 


He 


82 180 








185 








190 






85 lie Asp Asn Lys Gly 


He 


Asp 


Ser 


Asp 


Ala 


Ser Tyr 


Pro 


Tyr 


Lys Ala 


86 195 






200 








205 








89 Met Asp Leu Lys Cys 


Gin 


Tyr 


Asp 


Ser 


Lys 


Tyr Arg 


Ala 


Ala 


Thr 


Cys 


90 210 




215 








220 










93 Ser Lys Tyr Thr Glu 


Leu 


Pro Tyr Gly Arg Glu Asp Val 


Leu 


Lys 


Glu 


94 225 


230 










235 








240 


97 Ala Val Ala Asn Lys 


Gly 


Pro 


Val 


Ser 


Val 


Gly Val 


Asp Ala 


Arg 


His 


98 245 










250 








255 





101 Pro Ser Phe Phe Leu Tyr Arg Ser Gly Val Tyr Tyr Glu Pro Ser Cys 

102 260 265 270 

105 Thr Gin Asn Val Asn His Gly Val Leu Val Val Gly Tyr Gly Asp Leu 

106 275 280 285 

109 Asn Gly Lys Glu Tyr Trp Leu Val Lys Asn Ser Trp Gly His Asn Phe 

110 290 295 300 

113 Gly Glu Glu Gly Tyr He Arg Met Ala Arg Asn Lys Gly Asn His Cys 

114 305 310 315 320 

117 Gly He Ala Ser Phe Pro Ser Tyr Pro Glu He 

118 325 330 

121 <210> SEQ ID NO: 2 

122 <211> LENGTH: 996 

12 3 <212> TYPE: DNA 

124 <213> ORGANISM: Homo sapiens 

127 <220> FEATURE: 

128 <221> NAME/KEY: misc_feature 

129 <222> LOCATION: (1)..(996) 

130 <223> OTHER INFORMATION: Human cDNA sequence for Cathepsin S 

132 <400> SEQUENCE: 2 

133 atgaaacggc tggtttgtgt 
135 gatcctaccc tggatcacca 
137 gaaaagaatg aagaagcagt 

13 9 cttcacaacc tggagcattc 
141 ggagacatga ccagtgaaga 
143 tggcagagaa atatcacata 
145 tggagagaga aagggtgtgt 
147 gctttcagtg ctgtgggggc 
149 tctctcagtg cccagaacct 
151 aatggtggct tcatgacaac 
153 gacgcttcct atccctacaa 
155 gctgccacat gttcaaagta 
157 gctgtggcca ataaaggccc 
159 ctctacagaa gtggtgtcta 
161 cttgtggttg gctatggtga 
163 ggccacaact ttggtgaaga 
165 gggattgcta gctttccctc 

168 <210> SEQ ID NO: 3 

169 <211> LENGTH: 225 
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gctcttggtg tgctcctctg cagtggcaca gttgcataaa 60 

ctggcatctc tggaagaaaa cctatggcaa acaatacaag 12 0 

acgacgtctc atctgggaaa agaatctaaa gtttgtgatg 180 

aatgggaatg cactcatacg atctgggcat gaaccacctg 240 

agtgatgtct ttgatgagtt ccctgagagt tcccagccag 300 

taagtcaaac cctaatcgga tattgcctga ttctgtggac 360 

tactgaagtg aaatatcaag gttcttgtgg tgcttgctgg 420 

cctggaagca cagctgaagc tgaaaacagg aaagctggtg 480 

ggtggattgc tcaactgaaa aatatggaaa caaaggctgc 540 

ggctttccag tacatcattg ataacaaggg catcgactca 600 

agccatggat ctgaaatgtc aatatgactc aaaatatcgt 660 

cactgaactt ccttatggca gagaagatgt cctgaaagaa 720 

agtgtctgtt ggtgtagatg cgcgtcatcc ttctttcttc 780 

ctatgaacca tcctgtactc agaatgtgaa tcatggtgta 840 

tcttaatggg aaagaatact ggcttgtgaa aaacagctgg 900 

aggatatatt cggatggcaa gaaataaagg aaatcattgt 960 

ttacccagaa atctag 996 
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170 <212> TYPE: PRT 

171 <213> ORGANISM: Artificial 

173 <220> FEATURE: 

174 <223> OTHER INFORMATION: Amino acid sequence for resdues 114-331 of Cathepsin S with 



178 <220> FEATURE: 

179 <221> NAME / KEY : MISC_FEATURE 

180 <222> LOCATION: (1)..(218) 

181 <223> OTHER INFORMATION: Amino acid sequence for residues 114-331 of Cathepsin S 

183 <220> FEATURE: 

184 <221> NAME /KEY : MI SC_FEATURE 

185 <222> LOCATION: (219).. (225) 

186 <223> OTHER INFORMATION: Amino acid sequence for C-terminal Glycine-6x-histidine tag 
188 <400> SEQUENCE: 3 

190 lie Leu Pro Asp Ser Val Asp Trp Arg Glu Lys Gly Cys Val Thr Glu 

191 15 10 15 

194 Val Lys Tyr Gin Gly Ser Cys Gly Ala Cys Trp Ala Phe Ser Ala Val 

195 20 25 30 

198 Gly Ala Leu Glu Ala Gin Leu Lys Leu Lys Thr Gly Lys Leu Val Ser 

199 35 40 45 

202 Leu Ser Ala Gin Asn Leu Val Asp Cys Ser Thr Glu Lys Tyr Gly Asn 

203 50 55 60 

206 Lys Gly Cys Asn Gly Gly Phe Met Thr Thr Ala Phe Gin Tyr lie lie 

207 65 70 75 80 

210 Asp Asn Lys Gly lie Asp Ser Asp Ala Ser Tyr Pro Tyr Lys Ala Met 

211 85 90 95 

214 Asp Gin Lys Cys Gin Tyr Asp Ser Lys Tyr Arg Ala Ala Thr Cys Ser 

215 100 105 110 

218 Lys Tyr Thr Glu Leu Pro Tyr Gly Arg Glu Asp Val Leu Lys Glu Ala 

219 115 120 125 

222 Val Ala Asn Lys Gly Pro Val Ser Val Gly Val Asp Ala Arg His Pro 

223 130 135 140 

226 Ser Phe Phe Leu Tyr Arg Ser Gly Val Tyr Tyr Glu Pro Ser Cys Thr 

227 145 150 155 160 

230 Gin Asn Val Asn His Gly Val Leu Val Val Gly Tyr Gly Asp Leu Asn 

231 165 170 175 

234 Gly Lys Glu Tyr Trp Leu Val Lys Asn Ser Trp Gly His Asn Phe Gly 

235 180 185 190 

238 Glu Glu Gly Tyr lie Arg Met Ala Arg Asn Lys Gly Asn His Cys Gly 

239 195 200 205 

242 He Ala Ser Phe Pro Ser Tyr Pro Glu He Gly His His His His His 

243 210 215 220 

246 His 

247 225 

250 <210> SEQ ID NO: 4 

251 <211> LENGTH: 340 

252 <212> TYPE: PRT 

253 <213> ORGANISM: Artificial 
255 <220> FEATURE: 



a 



175 



C-terminal Glycine-6x-histidine tag 
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256 <223> OTHER INFORMATION: Amino acid sequence for Cathepsin S with an N-terminal 

257 Methionine- Proline and a C-terminal Glycine-6x-histidine tag 

260 <220> FEATURE: 

261 <221> NAME /KEY : MI SC_FEATURE 

262 <222> LOCATION: (1)..(2) 

263 <223> OTHER INFORMATION: Amino acid sequence for an N-terminal Methionine- Proline tag 

265 <220> FEATURE: 

266 <221> NAME/KEY: MISC_FEATURE 

267 <222> LOCATION: (3).. (333) 

268 <223> OTHER INFORMATION: Amino acid sequence for Cathepsin S 

270 <220> FEATURE: 

271 <221> NAME/KEY: MISC_FEATURE 

272 <222> LOCATION: (334) . . (340) 

273 <223> OTHER INFORMATION: Amino acid sequence for a C-terminal Glycine-6x-histidine 



2 75 


<400> SEQUENCE: 


4 
























Alt 


Met 


Pro 


Met 


Lys 


Arg 


T.nii 

U. 


val 


v_ y o 


Val 


Leu 


Leu 


Val 


Cys 


Ser 


Ser 


Ala 


2 78 


1 








5 










10 










15 




9 ft 1 
A. o X 


Val 


Ala 


Gin 


Leu 


n J. o 




Asp 


Pro 


Thr 


Leu 


Asp 


His 


His 


Trp 


His 


Leu 


282 








20 










25 










30 






2 85 


Trp 


Lys 


Lys 


Thr 


Tvr 

y 


Gly 


Lys 


Gin 


Tvr 

y 


Lys 


Glu 


Lys 


Asn 


Glu 


Glu 


Ala 


286 






35 










40 










45 








O -7 


Val 


Arg 


Arg 


Leu 


He 




Glu 


Lys 


Asn 


Leu 


Lys 


Phe 


Val 


Met 


Leu 


His 


290 




50 










55 










60 










293 


Asn 


Leu 


Glu 


His 


Ser 


Met 


Gly 


Met 


His 


Ser 


Tyr 


Asp 


Leu Gly 


Met 


Asn 


294 


65 










70 










75 










80 


297 


His 


Leu 


Gly Asp 


Met 


Thr 


Ser 


Glu 


Glu 


Val 


Met 


Ser 


Leu 


Met 


Ser 


Ser 


298 










85 










90 










95 




301 


Leu Arg 


Val 


Pro 


Ser 


Gin 


Trp 


Gin 


Arg 


Asn 


He 


Thr 


Tyr 


Lys 


Ser 


Asn 


302 








100 










105 










110 






305 


Pro 


Asn 


Arg 


He 


Leu 


Pro 


Asp 


Ser 


Val 


Asp 


Trp 


Arg 


Glu 


Lys 


Gly 


Cys 


306 






115 










120 










125 








309 


Val 


Thr 


Glu 


Val 


Lys 


Tyr 


Gin 


Gly 


Ser 


Cys 


Gly Ala Cys 


Trp 


Ala 


Phe 


310 




130 










135 










140 










313 


Ser 


Ala 


Val 


Gly 


Ala 


Leu 


Glu 


Ala 


Gin 


Leu 


Lys 


Leu 


Lys 


Thr 


Gly 


Lys 


314 


145 










150 










155 










160 


317 


Leu 


Val 


Ser 


Leu 


Ser 


Ala 


Gin 


Asn 


Leu 


Val 


Asp 


Cys 


Ser 


Thr 


Glu 


Lys 


318 










165 










170 










175 




321 


Tyr Gly 


Asn 


Lys 


Gly 


Cys 


Asn 


Gly 


Gly 


Phe 


Met 


Thr 


Thr 


Ala 


Phe 


Gin 


322 








180 










185 










190 






325 


Tyr 


He 


He Asp 


Asn 


Lys 


Gly 


He 


Asp 


Ser 


Asp Ala 


Ser 


Tyr 


Pro 


Tyr 


326 






195 










200 










205 








329 


Lys 


Ala 


Met 


Asp 


Gin 


Lys 


Cys 


Gin 


Tyr 


Asp 


Ser 


Lys 


Tyr Arg 


Ala 


Ala 


330 




210 










215 










220 










333 


Thr Cys 


Ser 


Lys 


Tyr 


Thr 


Glu 


Leu 


Pro 


Tyr 


Gly Arg Glu Asp 


Val 


Leu 


334 


225 










230 










235 










240 


337 


Lys 


Glu 


Ala 


Val 


Ala 


Asn 


Lys 


Gly 


Pro 


Val 


Ser 


Val 


Gly Val 


Asp 


Ala 


338 










245 










250 










255 




341 


Arg 


His 


Pro 


Ser 


Phe 


Phe 


Leu 


Tyr 


Arg 


Ser 


Gly Val 


Tyr 


Tyr 


Glu 


Pro 


342 








260 










265 










270 
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345 Ser Cys Thr Gin Asn Val Asn His Gly Val Leu Val Val Gly Tyr Gly 

346 275 280 285 

34 9 Asp Leu Asn Gly Lys Glu Tyr Trp Leu Val Lys Asn Ser Trp Gly His 
350 290 295 300 

353 Asn Phe Gly Glu Glu Gly Tyr lie Arg Met Ala Arg Asn Lys Gly Asn 

354 305 310 315 320 

357 His Cys Gly lie Ala Ser Phe Pro Ser Tyr Pro Glu lie Gly His His 

358 325 330 335 

361 His His His His 

362 340 
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Invalid <213> Response: 

Use of "Artificial" only as "<213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

Seq# : 3 , 4 
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